IN THE CLAIMS 



1 .(Currently Amended) A method for creating a multimedia presentation 
having video frames and video segments with synchronized audio, said method 
comprising the steps of: 

(g) receiving audio-video data; 

(h) separating said audio-video data into an audio stream and a 
video sequence; 

(i) dividing said video sequence into video segments, each of said video 
segments comprising a group of slide frames; 

(j) for each of said video s e gm e nt segments ; 

(d1) calculating an audio significance measure using said audio 
stream related to each of said video segm e nt segments ; 

(d2) using at least said audio significance measure, selecting either of 
each said video s e gm e nt segments in its entirety or extracting at least one slide 
frame from said corresponding group of frames; 

(k) synchronizing said audio stream and satef selected video s e gm e nt 
segments and said at least one slide fram e s frame ; and 

(I) synchronously reproducing selected video segments and said at least 
one slide fram e s frame and said audio stream as said multimedia presentation. 

2. (Original) The method according to claim 1, wherein, in step (d1), said 
audio significance measure is calculated from a measure of activity within said 
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audio stream. 



3. (Original) The method according to claim 2, wherein said measure of 
activity is determined from a frequency domain representation of said audio 
stream. 

4. (Original) The method according to claim 2, wherein said measure of 
activity is calculated from one or more of power, peak-frequency and frequency 
spread. 

5. (Original) The method according to claim 4, wherein said measure of 
activity is calculated as power + peak frequency + spread. 

6. (Original) The method according to claim 4, wherein said measure of 
activity is calculated as 2 x power + peak frequency + 0.5 x spread. 

7. (Original) The method of claim 1, wherein, in step (d2), said selection 
is based upon a comparison of said audio significance measure with an activity 
threshold, such that if said threshold is exceeded, then said video segment is 
selected, else if not exceeded, said at least one slide frame is selected. 

8. (Currently Amended) A method for creating a multimedia presentation 
having video frames and video segments with synchronized audio, said method 



comprising the steps of: 

(g) receiving audio-video data; 

(h) separating said audio-video data into an audio stream and a video 
sequence; 

(i) dividing said video sequence into video segments, each of said video 
segments comprising a group of slide frames; 

(j) for each of said video s e gm e nt segments ; 

(d1 ) extracting at least one representative frame from the 
corresponding said group of frames; 

(d2) calculating a video significance measure using said the 
extracted at least one frame fram e s ; 

(d3) calculating an audio significance measure using said the 
audio stream related to said each of said video segment; 

(d4) using said video and audio significance measures, selecting 
either said each of the video s e gm e nt segments in its entirely or extracting at 
least one slide frame from said group of frames; 

(k) synchronizing said audio stream and said the selected video segments 

and the extracted said at least one slide fram e s frame ; and 
(I) synchronously reproducing said the selected video segment segm e nts 
and the extracted slide fram e s frame and said audio stream. 

9. (Original) The method according to claim 8, wherein, in step (d1), said 
audio significance measure is calculated from a measure of activity within said 



audio stream. 

10. (Original) The method according to claim 9, wherein said measure 
of activity is determined from a frequency domain representation of said audio 
stream. 

1 1 . (Original) The method according to claim 8, wherein said measure 
of activity is calculated from one or more of power, peak-frequency and 
frequency spread. 

12. (Original) The method according to claim 1 1, wherein said measure 
of activity is calculated as power + peak frequency + spread. 

13. (Original) The method according to claim 1 1, wherein said 
measure of activity is calculated as 2 x power + peak frequency + 0.5 x 
spread. 

14. (Original) The method according to claim 8, wherein said video 
significance measure is determined from a level of relative movement between 
said frames. 

15. (Original) The method according to claim 14, wherein said frames 
comprise objects and said level of relative movement is determined from a 



direction and magnitude of motion (u^vj of each object in said frames to derive 
an activity value of the frame. 



16. (Original) The method according to claim 15, wherein said activity 
value is determined from the standard deviation of the direction and magnitude 
(Uj,Vj) of each object. 

17. (Original) The method of claim 16, wherein said activity value is 
determined according to the expression: 



wherein (u,v) is the average direction and magnitude of motion of all objects in 
said frame. 

18. (Original) The method of claim 8, wherein, in step (d4), said 
selection is based upon a comparison of said audio significance measure with an 
activity threshold, such that if said threshold is exceeded, then said video 
segment is selected, else if not exceeded, said at least one slide frame is 
selected. 

19. (Original) The method of claim 18, wherein said combined 
significance measure is equal to one of the audio significance measure, the video 
significance measure, or an average of said audio and video significance 




measures depending upon the level of audio and video activity. 

20. (Currently Amended) Apparatus for creating a multimedia 
presentation having video frames and video segments with synchronized audio, 
said apparatus comprising: 

input means for receiving audio-video data; 

means for separating said audio-video data into an audio stream and a 
video sequence; 

means for dividing said video sequence into video segments, each of said 
video segments comprising a group of slide frames; 

processor means which, for each of said video s e gm e nt segments , 
calculates an audio significance measure using s«rtd the audio stream related to 
satd each of the video s e gm e nt segments , and, using at least said audio 
significance measure, selects either scrtd each of the video s e gm e nt segments in 
its entity or e xtract extracts at least one slide frame from said corresponding 
group of frames; 

means for synchronizing said audio stream and [said] selected video 
segments and extracted slide frames; and 

means for synchronously reproducing selected video segments and 
extracted slide frames and said audio stream as said multimedia presentation. 

21 . (Currently Amended) Apparatus for creating a multimedia 
presentation having video frames and video segments with synchronized audio, 



comprising: 

means for receiving audio-video data; 

means for separating said audio-video data into an audio stream and a 
video sequence; 

means for dividing said video sequence into video segments, each of said 
video segments comprising a group of slide frames; 

processor means which, for each of said video s e gm e nt segments , 
extracts at least one representative frame from the corresponding sattf group of 
frames, calculates a video significance measure using s*kf the extracted at least 
one fram e s frame , calculates an audio significance measure using said the audio 
stream related to said each of said video s e gm e nt segments , and, using said 
video and audio significance measures, selects either said each of said video 
s e gm e nt segments in its entirely or e xtract extracts at least one slide frame from 
said the group of frames; 

means for synchronizing said audio stream and said selected video 
segments and extracted slide frames; and 

means for synchronously reproducing said the selected video segments 
and the extracted slide frames and said audio stream. 

22. (Currently Amended) A computer program product including a 
computer readable medium incorporating a computer program for creating a 
multimedia presentation having video frames and video segments with 
synchronized audio, said computer program having: 



input code means for receiving audio-video data; 

code means for separating said audio-video data into an audio stream and 
a video sequence; 

code means for dividing said video sequence into video segments, each of 
said video segments comprising a group of slide frames; 

processing code means which, for each of said video s e gm e nt segments , 
calculates an audio significance measure using satef the audio stream related to 
said each of said video s e gm e nt segments , and, using at least said audio 
significance measure, selects either said each of said video s e gm e nt segments in 
its entity or e xtract extracts at least one slide frame from said corresponding 
group of frames; 

code means for synchronizing said audio stream and [said] selected video 
segments and extracted slide frames; and 

code means for synchronously reproducing selected video segments and 
extracted slide frames and said audio stream as said multimedia presentation. 

23. (Currently Amended) A computer program product including a 
computer readable medium incorporating a computer program for creating a 
multimedia presentation having video frames and video segments with 
synchronized audio, said computer program having: 

code means for receiving audio-video data; 

code means for separating said audio-video data into an audio stream and 
a video sequence; 



code means for dividing said video sequence into video segments, each of 
said video segments comprising a group of slide frames; 

processing code means which, for said each of said video s e gm e nt 
segments , extracts at least one representative frame from the corresponding 
said group of frames, calculates a video significance measure using sattf the at 
least one extracted frame frames , calculates an audio significance measure using 
satd the audio stream related to said each of said video s e gm e nt segments , and, 
using said video and audio significance measures, selects either said each of said 
video s e gm e nt segments in its entirety or e xtract extracts at least one slide 
frame from said group of frames; 

code means for synchronizing said audio stream and strtd selected video 
segments and extracted slide frames; and 

code means for synchronously reproducing said the selected video 
segments and extracted slide frames and said audio stream. 
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